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Abstract 

Motivated by the need of observers that are both robust to disturbances and guar¬ 
antee fast convergence to zero of the estimation error, we propose an observer for linear 
time-invariant systems with noisy output that consists of the combination of N coupled 
observers over a connectivity graph. At each node of the graph, the output of these 
interconnected observers is defined as the average of the estimates obtained using local 
information. The convergence rate and the robustness to measurement noise of the pro¬ 
posed observer’s output are characterized in terms of ICC bounds. Several optimization 
problems are formulated to design the proposed observer so as to satisfy a given rate of 
convergence specification while minimizing the gain from noise to estimates or the 
size of the connectivity graph. It is shown that that the interconnected observers relax 
the well-known tradeoff between rate of convergence and noise amplification, which is 
a property attributed to the proposed innovation term that, over the graph, couples 
the estimates between the individual observers. Sufficient conditions involving infor¬ 
mation of the plant only, assuring that the estimate obtained at each node of the graph 
outperforms the one obtained with a single, standard Luenberger observer are given. 
The results are illustrated in several examples throughout the paper. 


1 Introduction 

We consider linear time-invariant systems of the form 

x = Ax, y = Cx + m(t), (1) 

where x G M n , y E W p , and t K > m(t ) denotes measurement noise, for which there exists a 
Luenberger observer 

x L = Ax l - K L (y L - y), y L = Cx L (2) 
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supported by the National Science Foundation under CAREER Grant no. ECS-1150306 and by the Air 
Force Office of Scientific Research under Grant no. FA9550-12-1-0366. 
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leading to the exponentially stable error system 

e L = (A- K L C)e L + K L m(t) := A L e L + K L m(t ) (3) 

with estimation error given by := xl — x. It is well-known that, under observability 
conditions of ©, the matrix gain K L can be chosen to make the convergence rate of (131) 
arbitrarily fast. However, due to the fast convergence speed requiring a large gain, the price 
to pay is that the effect of measurement noise m is amplified. Indeed, the design of observers, 
such as those in the form (|2|) , involves a tradeoff between convergence rate and robustness 
to measurement noise EG] In fact, in [TJ page 597], D. G. Luenberger makes the following 
remark about the error system (jHJ) when (C, A) is observable: “Theoretically, the eigenvalues 
can be moved arbitrarily toward minus infinity, yielding extremely rapid convergence. This 
tends, however, to make the observer act like a differentiator and thereby become highly 
sensitive to noise, and to introduce other difficulties.” Along the same lines, the authors of 
|2] recognize the compromise between performance and robustness in the design of (J2J) : “At 
this point we can only offer some intuitive guidelines for a choice of K to obtain satisfactory 
performance of the observer. To obtain fast convergence of the reconstruction error to zero, 
K should be chosen so that the observer poles are quite deep in the left-half complex plane. 
This, however, generally must be achieved by making the gain matrix K large, which in turn 
makes the observer very sensitive to any observation noise that may be present, added to 
the observed variable y(t). A compromise must be found,” see [21 page 332], Unfortunately, 
this issue is also at the core of every estimation algorithm for multi-agent systems. 

1.1 Related work 

Several observer architectures and design methods with the goal of conferring good perfor¬ 
mance and robustness to the error system have been proposed in the literature. In particular, 
Hoq tools have been employed to formulate sets of Linear Matrix Inequalities (LMIs) that, 
when feasible, guarantee that the C 2 gain from disturbance to the estimation error is below 
a pre-established upper bound; see, e.g., [31 SUSIE], to just list a few. Following ideas from 
adaptive control HUB], observers with a gain that adapts to the plant measurements have 
been proposed in [HI SO], though the presence of measurement noise can lead to large values 
of the gains. Such issues also emerge in the design of high-gain observers, where the use 
of high gain can significantly amplify the effect of measurement noise. Indeed, in [111 S2j, it 
is shown that measurement noise introduces an upper limit for the gain of a high-gain ob¬ 
server with constant gain when good performance is desired. More recently, observers using 
essentially two set of gains, one set optimized for convergence and the other for robustness, 
have been found successful in certain settings. Such approaches include the piecewise-linear 
gain approach in [13] for simultaneously satisfying steady-state and transient bounds, the 
high gain observer with nonlinear adaptive gain in [TJ], and the high gain observer with 
on-line gain tuning in PE]- Also recently, a switching algorithm combining two observers for 
performance improvement was proposed in [15]. 

Recent research efforts in multi-agent systems have lead to enlightening results in dis¬ 
tributed estimation and consensus. Distributed Kalman filtering are employed for achieving 
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spatially-distributed estimation tasks in HU and for sensor networks in [IH1HH1 [2Q1 EH |22]. 
To characterize the effect of unmodeled dynamics on the consensus multi-agent system, in 
[23], a region-based approach is used for distributed TUo-based consensus of multi-agent 
systems with an undirected graph. For dynamic average consensus, [SM] proposes a decen¬ 
tralized algorithm that guarantees asymptotic agreement of a signal over strongly connected 
and weight-balanced graphs. In [25], switching inter-agent topologies are used to design 
distributed observers for a leader-follower problem in multi-agent systems. For estimating 
the trajectory of a moving target with perturbed dynamics, nonlinear filters based on net¬ 
worked sensors are proposed in [26, 27]. However, distributed estimation algorithms that 
systematically meet specifications of performance and robustness to measurement noise are 
not available. 

1.2 Contributions 

We propose a new observer, called interconnected observers, with improved convergence 
rate of the estimation error and robustness to measurement noise, when compared with the 
observer in ([2]). The proposed observer consists of N linear time-invariant observers intercon¬ 
nected over a graph. The local estimate at each node is provided by an observer featuring an 
innovation term that appropriately injects the estimate obtained from its neighbors, which 
can be computed in a decentralized manner. The global estimate of the state of the plant is 
given by the average of the local estimates. 

The main contributions of this paper are threefold. 

1) We establish that, under certain conditions involving its parameters, and when com¬ 
pared to the Luenberger observer in ([2]), the proposed observer significantly improves 
the rate of convergence and the gain from measurement noise to estimation error, with 
improvements of more than 50% at times (see Table EJ). 

2) We characterize the convergence rate and the robustness to measurement noise of the 
proposed observer in terms of ICC bounds, which provide useful information on how 
the parameters of the observers affect such properties. 

3) We formulate optimization problems for the purpose of the design of interconnected 
observers. 

i) For a fixed rate of convergence, optimization problems are proposed for the de¬ 
sign of interconnected observers with optimized gain from measurement noise to 
estimation error (local and global). 

ii) For a fixed rate of convergence and a desired gain, optimization problems 
that minimize the number of edges of the connectivity graph are also formulated. 
Using appropriate coordinates and conditions, we show that these problems can 
be converted into convex optimization problems that can be used to efficiently 
design interconnected observers. 
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iii) An LMI condition involving only information about the plant is provided to guar¬ 
antee that the estimate obtained at each node of the graph outperforms the one 
obtained with a single, standard Luenberger observer, which uniquely relaxes the 
well-known trade off between rate of convergence and noise amplification. 

Examples throughout the paper illustrate our results and their applicability to estimation in 
multi-agent systems, such as mobile and sensor networks. To the best of our knowledge, we 
are not aware of an observer in the literature that guarantees such properties simultaneously. 

1.3 Organization of the Paper 

The remainder of this paper is organized as follows. In Section [21 the idea and benefits behind 
interconnected observers are presented in a motivational example. Section [3] introduces the 
proposed observer in general form, the ICC bounds, and the design methods in terms of 
optimization problems. Section [4] discusses the optimization of the number of nodes in the 
graph, a decentralized method to compute a global estimate, and a relationship with the 
optimal observer. 

2 Motivational Example 

Consider the scalar plant 


x = ax, y = x + m, (4) 

where m denotes measurement noise and a < 0. Suppose we want to estimate the state x 
from measurements of y. Following (J2]), a Luenberger observer for (J4J) is given by 

x L = ax L - K L (y L -y), y L = x L . (5) 

The resulting estimation error system is given by ([3]) with Al = a — Kl- Us rate of con¬ 
vergence is a — Kl and, when m is constant, its steady-state error is e* L := It is 

apparent that to get fast convergence rate, the constant K L needs to be positive and large. 
However, as argued in the introduction, with Kl large, the effect of measurement noise is 
amplified. In light of recent popularity of multi-agent systems, it is natural to explore the 
advantages of using more than one measurement of the plant’s output so as to overcome 
such a tradeoff. 

In this paper, we show that it is possible to design interconnected observers that are 
capable of relaxing the said tradeoff. More precisely, interconnected observers are proposed 
to improve the rate of convergence and the robustness to measurement noise, when compared 
to a single Luenberger observer. To illustrate the idea behind the proposed observer, consider 
the estimation of the state of the scalar plant (J3[) with two agents, each taking its own 
measurement of y. The two agents can communicate with each other according to the 
following directed graph: agent 1 can transmit information to agent 2, but agent 2 cannot 
send data to agent 1. This is shown in Figure [U 
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Xi — X\ x 2 = |(xi + x 2 ) 

Figure 1: Two agents connected as a direct graph. 


Following the approach in this paper, an interconnected observer would take the form 

x, = ax ± - Kn(yi - y x ), x 2 = ax 2 - K 22 (y 2 - y 2 ) - K 2 i{yi - yi), 

„ „ „ „ _ „ _ x\ + x 2 ( 6 ) 

Ui=Xi, V2 = x 2 , x 1 = x 1 , x 2 = --—, 

where Xi and Xi are associated with agent i, each measured plant output y t is corrupted by 
measurement noise m*, that is y\ = x + mi and y 2 — x + m 2 , respectively, where mbs are 
independent. The term K 2 \(yi—yi) v defines an innovation term exploiting the information 
shared by agent 1 with agent 2. The output x^ of agent i defines the local estimate (at agent 
i) of x. Since agent 1 only has access to its own information, we have x\ — X\, while since 
agent 2 has also information from its neighbor, agent 2’s output x 2 can be taken as the 
average of the states x\ and X 2 0 

To analyze the estimation error induced by the interconnected observer in (l6l) . define 
error variables e* := Xj — x, % 6 {1, 2}. Then, the error system is given by 

ei = (a - K 11 )e 1 + K u mi, e 2 = —K 21 e 1 + (a - K 22 )e 2 + K 2l mi + K 22 m 2 , (7) 

which can be written in matrix form as 


e = Ae + Km, 

where e=[e\ e 2 ] r , m = [mi m 2 ] T , 


a — Ku 

O 

, K = 

' Ku 

l 

O 

I 

_ l 

a — K 22 

_ K 2 i 

K 22 J 


( 8 ) 

( 9 ) 


Then, when K n , K 2 \, and K 22 are chosen such that A is Hurwitz and when m is constant, 
the steady-state value of (JHD is given by 


et = 


K 


n 


K 


-mi, 


en = 


-aK 


21 


11 


(An — a) (A 22 — a) 


mi + 


K 


22 


AT- 


-m 2 . 


22 


( 10 ) 


Furthermore, the local estimation error resulting from each agent is given by the quantity 
e % := Xi — x, i 6 {1, 2}, and has a steady-state value given by 


el = e^, 


_* Kn(K 22 — a) — aK 2 i 
e 2 = -rr t? -r' m i + 


AT. 


22 


2( A n —a) (A 22 — o) 2(K 22 — a) 


m 2 . 


1 In general, x 2 could be the convex combination of x\ and x 2 , i.e., x 2 = siii + s 2 x 2 , S 1 + S 2 = l,si,S 2 G 
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Let K u = K 2 2 = K l . Because of the structure of A, it can be verified that the rate 
of convergence for the estimation error (181) is a — K^, which is the same as that of the 
Luenberger observer ([5]). Moreover, assuming that constant noise mi and m 2 are equal, i.e., 
mi = m 2 = mo, then 


_* _ 2K l (K l - a) - a,K 2l 
62 _ 2 (K l - ay- m ° 


Interestingly, picking K 2 1 — 2f '' 2 ' , we obtain e£ — 0 for any unknown constant mo, 
namely, the measurement noise can be completely rejected. When constant noise mi and 
m 2 are not equal, the choice K 2 1 = leads to e 2 = ,-, ( , / ^ L _^ m 2 , which is a significant 

improvement (50%) over the case that agent 2 only has access to its own measurement (in 
which case e 2 = A ^i, t m 2 ). These properties cannot be achieved by using the Luenberger 
observer in (J5j). 

For general measurement noises mi and m 2 (not necessarily constant), the H^ norrrj§ 
from noise to the estimation error can be employed to study the noise effect. As shown in 
Figure 2(b), when A" 21 ps —4.75, the H 00 gain from noise m to the local estimate e 2 achieves 
a minimum equal to 0.45, which is much smaller than that of the Luenberger observer in 
(J5]), which is 0.8, with equal rate of convergence (A% = 2, a = —0.5). 




(a) Hoo norm from noise m to estimation error cl (b) The local TAo norm from noise m to estimation 
with respect to the parameter Kl. error e 2 with respect to the parameter K 2 1 - 

Figure 2: Comparison between the norms for the proposed observer and the Luenberger 
observer, with fixed parameters ATn = K 22 = 2 and a = —0.5 (improved by approximately 
43.8%). 

ft is important to point out that the observer proposed in this paper will also outperform 
the Luenberger observer in ([5]) when, in addition, agent 2 can transmit information to agent 

2 By a H oa norm” we mean the C 2 gain from m to e, which is the induced 2-norm of the complex matrix 
transfer function from m to e. 
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1, i.e., the link between the two agents is bidirectional. Such an improvement is unique for 
the following two reasons. When the two agents are connected by a bidirectional link, our 
observer can be considered to be a bank of two observers providing a global estimate that 
averages the estimate of each individual observer. When the innovation terms i^ 2 i(^i — 
i/i)” and “—^ 12(^2 — y 2 )” are missing, it can be shown that the effect of noise in the global 
estimate cannot be reduced - bank of observers currently available in the literature suffer 
from this shortcoming (see Appendix [D] for a proof of this claim). This suggests that the 
innovation terms in our interconnected observer are key. The second reason stems from 
the fact that our observer can be viewed as an “augmented-dimension observer” since, in 
general, it would have dimension Nn for a plant of dimension n. This property would 
contradict the well-known fact that an observer in the form ([5]) (or, in general, of the form 
(|2|) ) minimizing the mean square estimation error under perturbations has necessarily the 
same dimension as the plant (see, e.g., [21 Section 4.2, Definition 4.3, and Theorem 4.5] and 
Section 14.3ft . However, when performance specifications (relative to the optimal observer) 
are added, which, in this paper, are formulated in terms of eigenvalue constraints, an n- 
dimensional observer may not be optimal. The augmented dimension (larger than the plant) 
is the key feature that enables our observer to outperform observers of the form (]5j), in 
particular, by mitigating the typical amplification of noise due to large gain required to 
speed up convergence. 

As we show next, the idea behind the proposed interconnected observer illustrated in the 
example above generalizes to the case where N agents can measure the plant’s output and 
share information over a graph. 

3 Interconnected Observers 

3.1 Notation and basic definitions 

Given a matrix A with Jordan form A = XJX~ 1 , ot(A) := max{Pe(A) : A G eig(A)}, 
where eig(A) denotes the eigenspace of A; jj,(A) := max{Pe(A)/2 : A G eig(A + A T )}; 
\A\ := max{|A|^ : A G eig(A T A)}; k(A) := min{|V||V -1 | : A = XJX _1 }; A is dissipative 
if A + A T < 0. Given a vector u G M n , |u| := Vu T u. Given a Lebesgue measurable 
function t 1 —> G(t), the norm ||G||i is defined by ||G||i := J 0 °° \\G(t)\\dt, where ||G(t)|| = 
sup{|G(t)w| : u G M"and|it| < 1} for all t > 0. Given a function m : M>o —» M n , 
|m|oo := sup t>0 \m(t)\. Given a function v : M> 0 — > M, D + v(t ) := limsup fa _ ) . 0+ A t + h )-A t ) i _ 
The set of complex numbers is denoted by C. The set of natural numbers is denoted by 
N := {1,2,3, •••}. Given a symmetric matrix P, A max (P) := max{A : A G eig(P)} and 
A min (P) := min{A : A G eig(P)}. For a continuous transfer function C9 s4 T(s) G C, 
the Poo norm is defined as ||T||oo = su P^gR l|l"(i a; )l|) T is called stable if all its poles have 
negative real part, the dominant pole for a stable transfer function is the pole with largest real 
part, the rate of convergence of a closed-loop system with stable transfer function is defined 
by the absolute value of real part of the dominant pole. Given a function / : [0, 00 ) — > M, / 
is square integrable if f Q f(t)dt exists and is finite. For a continuous differentiable function 
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f{x) el, / is pseudo-convex if for any x\, x 2 E K such that V/(xi) t (x 2 — X\) > 0, 
then f{x 2 ) > f(xi); furthermore, / is pseudo-concave if —/ is pseudo-convex. Given 
matrices A, 5 with proper dimensions, we define the operator He(A, B) := A T i? + B T A ; 
A® B defines the Kronecker product; and A* B defines the Khatri-Rao product. Given 
N E N, In E M. NxN defines the identity matrix, In is the vector of N ones, and 11^ := 
In ~ Given a set S, the function card(S') defines the cardinality of the set S. A 

function a : M> 0 —» M> 0 is a class-ZC^ function, also written a E /C Qc , if a is zero at zero, 
continuous, strictly increasing, and unbounded. A function (3 : M> 0 x M> 0 —> M>o is a class- 
fCC function, also written (3 E ICC, if it is nondecreasing in its first argument, nonincreasing 
in its second argument, lim r _> 0 + (3(r, s) = 0 for each s E M>o, and lim^oo /3(r, s) = 0 for each 
r E M> 0 . 

3.2 Preliminaries on graph theory 

A directed graph (digraph) is defined as V = (V,£,G). The set of nodes of the digraph 
are indexed by the elements of V = {1,2,..., A^}, and the edges are the pairs in the set 
£ C V x V. Each edge directly links two nodes, i.e., an edge from i to j, denoted by (i,j), 
implies that agent % can send information to agent j. The adjacency matrix of the digraph T 
is denoted by G = (gij) E R jVx7V , where g lJ = 1 if (i,j) E £, and g tJ = 0 otherwise. A digraph 
is undirected if g VJ = for all i,j E V. The in-degree and out-degree of agent i are defined 
by d m (i ) = Ylj=i9ji an d d out (i ) = Yhj=i9ii- ^ digraph is weight-balanced if, for each node 
i EV , the in-degree equals its out-degree. The in-degree matrix D is the diagonal matrix 
with entries Da = d in (i), for all i E V. The Laplacian matrix of the graph T, denoted by C, 
is defined as C = D — G. The Laplacian has the property that CIn = 0. Therefore, 0 is an 
eigenvalue of the matrix C. Furthermore, the rest of the eigenvalues of C have nonnegative 
real parts. Denote by AR the set containing all edges that connected to the i-th agent, i.e., 
Mi := {( j,i ) : j E V, (j,i) E £}. The set of indices corresponding to the neighbors that can 
send information to the i-tli agent is denoted by T{i) := {j E V : ( j,i ) E £}. 


3.3 Observer structure and basic properties 

The general form of the proposed observer consists of N interconnected observers with output 
given by the average over a graph of the states of the individual observers]! Specifically, 
consider a network of N agents defined by a graph T = (V,£,G). For the estimation of the 
plant’s state, a local state observer using information from its neighbors is attached to each 
agent. More precisely, for each i 6 V, the agent i runs a local state observer given by 


Xi = 


Axi - ^ Kij{yj -yj), 

jex(i) 


lh = Cxi , Xi = 


E 


Xi 


card(X(i)) ^ ■> 


( 11 ) 


3 More general linear combinations defining Xi are possible, i.e., Xi = ^2j e x(i) Vj&j with r/ ? £ R for all j 
and Ejex(i) Vj = !■ 




where x, denotes the state variable of the observer, x, is the local estimate of the plant’s state 
x, and yi denotes the measurement of y in (OQ) taken by the i-th agent under measurement 
noise rrp, that is ?/* = Cx + rrii. The information that the i-th agent obtains from its neighbors 
are the values of x/s and yf s for each j E I(i). The collection of local state observers in 
di connected via the graph T defines the proposed interconnected observer. 

To analyze the properties of interconnected observers, define for each i E V, e* := x, — x 
and the associated vector e := (ei,..., e#)- Furthermore, dehne the local estimation error 
hj := Xj — x, the global estimation error vector e := (ei,..., e^v), and the noise vector 
m := (mi,..., mjv)- Then, it follows that 


e* h/ ^ ^ KijCej T ^ ^ F 
jgxp) jez(i) 


ijTUji 


e-i = 


E 


card(X(z)) ^ ' r 


( 12 ) 


which can be rewritten in the compact form 

e={I N ®A-(lC*G T )(I N ®C))e+(lC*G T )m, e = p- 1 ®/ n )(G T (8)/ n )e, (13) 


where G is the adjacency matrix, D is the in-degree matrix, 



K u 

K\2 

• K 1N 

k 21 

K‘22 

K 2 n 

Km 

Kn2 ■ ■ 

■ k nn 


(14) 


and the Khatri-Rao product /C * G T is such that /C is treated as N x N block matrices with 
Kij 's as blocks. Dehne 

A:=I n ®A- ()C*G t )(I n ®C), B:=JC*G t , C := (D~ 1 (8)/ n )(G T < 8 >/ n ). (15) 

Then, the transfer function from measurement noise m to error e is given by T(s) = C(sl — 
A) l B. For the purpose of designing the proposed interconnected observer, each agent is 
self-connected, i.e., (i,i) E £. Therefore, we have tr (D) > N. 


Remark 3.1 The matrix In ® A is a block diagonal matrix with matrix A in each of the 
N diagonal blocks (of dimension n x n). The matrix /C * G T defines the gain matrix for the 
graph, while (D -1 <g) I n )(G T <g) I n ) generates the estimation matrix for each agent by averaging 
the local estimates obtained from its neighbors. 


It can be verified that, under a detectability condition, interconnected observers can be 
designed so that the origin of the error system in (fT3l) is (exponentially) stable. 

Proposition 3.2 For the plant © with measurement noise rrii = 0 for each agent i, if 
the pair {A, C) is detectable, then, for any N E N, there exists a digraph V with adjacency 
matrix G and a gain JC such that the matrix A is Hurwitz and the resulting system (1T3]) has 
its origin exponentially stable. 
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Proof For any N E N, consider G = In- Then it follows that e* = (A — K ii C)e i for each 
i E V. Under the assumption that the pair (A, C ) is detectable, immediately we know that, 
for each i E V, there exists Ka such that A — K^C is Hurwitz. Therefore, the resulting A is 
Hurwitz. ■ 


3.4 ICC characterization of performance and robustness 

In this section, the performance and robustness properties of observers are characterized 
in terms of fCC bounds. More precisely, given an observer with estimation error e, we are 
interested in bounds of the form 


|e(t)| < /3(|e(0)|,t) + </?(|m|oo) Wt > 0, 

where 1 H > e(t) is a solution to the error system, (3 is a class-/C£ function, and p> is a class-ZC^ 
function. To establish and compare this property with that of the interconnected observers, 
the next result characterizes such bounds for the proposed observer so that it can be designed 
to outperform those due to a Luenberger observers. 


Proposition 3.3 For the plant ©. assume the pair (4, C ) is detectable. Let N eN and a 
digraph V = (V, £, G ) be given. If there exists a gain 1C such that at least one of the following 
conditions are satisfied: 


1) The matrix A is Hurwitz with distinct eigenvalues; 

2) The matrix A is dissipative, i.e., for some a > 0, A T + A < —2 al; 

3) There exists P = P T > 0 such that He(M, P) < —2 aP for some a > 0/ 


then, there exist a class-lCC function ft : M > 0 x M > 0 —> M > 0 and a class-K function p : M > 0 —> 
M>o such that the solution e of (1131) from any e(0) E M. nN satisfies 


|e(t)| </3(|e(0)|,f) + ^(|m|oo) Vf G M> 0 . (16) 

In particular, the functions (3 and p can be chosen, for all s,t > 0, as follows: if 1) 
holds, then, /3(s,t ) = K(A)|C|exp(a(M)t)s, </?(s) = if 2) holds, then, (3(s,t) = 

\C\ exp (fi(A)t)s, (p(s) = if 3) holds, then, j3(s,t ) = y/cf\C\ exp(-A t)s, ip{s) = Cp^jjps, 


with A = 

AmaxfU ) 


and c v = \ 

* A 


Amax(-P) 


,(P)' 


Proof The proof can be found in Appendix [Aj ■ 

Proposition 13.31 provides a way to design parameters for the interconnected observer as 
follows. Recall that A L and K L are defined in (|3]). 


Theorem 3.4 For the plant (P) with the Luenberger observer (J2D and the interconnected 
observers CD, let N E N and a digraph T be given. If Kl is such that at least one of the 
following conditions are satisfied: 
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1) A L is Hurwitz with distinct eigenvalues, and there exists 1C such that a(A) < a(A L ) 
and k{A) < n(A L )-^~- 


\AAl)V 


2) A l is dissipative, and there exists 1C such that p(A) < /x(A L ) (or a(A) < a(A L ), 


respectively - see below c)) and 


\B\\C\ 


< 


\Kl\ 


HA\ ^ \/j,(A l )\’ 


3) Al satisfies H e(AL, Pl) < —2 (AlPl f° r some > 0 and Pl = Pj > 0, and there 
exists 1C such that 


3.1) item 3) of Proposition ^. 3i holds with a > 0, P = P T > 0, 


3.2) A := 

CpL 


«A min (P) 
Amax(-P) 
■^max (Pl) . 
Amin (Pl) ’ 


< 


-^min (Pl) = . 

Ama x(-Pl) 


Amax(i I) ) 


and 


then, there exist /3 G ICC and p G /C^ such that the solution e of (1T3T) from any e(0) G M nJV 
satisfies 


a) |e(t)| < /3(|e(0)|, t) + tp(\m\ 00 ) for all t > 0; 

b) Given nonzero e(0) and e^O), 3 1* > 0 such that /5(|e(0)|, t) < /3z,(|ex,(0)|, t) for all 
t > t*; 


c) tp(s) < <Pl(s), f° r all s 7 ^ 0, s G M> 0 . 

In particular, the functions (3 G ICC and ip G /Coo can be chosen accordingly as in Propo¬ 
sition E3 while (3 l G ICC and <pi G /Coo can be chosen, for all s,t > 0, as follows: 
if 1) holds, then /3 L (s,t) = n(A L )exp(a(A L )t)s, ^l(s) = k(A l ) s ; if 2) holds, then 

/ d L {s,t ) = exp (n(A L )t)s (or /3 L (s,t) = K{A L )exp(a{A L )t)s, respectively), <p L (s) = s; 

if 3) holds, then /3 L (s,t) = yc^exp(-A L t)s, <Pl{s) = c pL j^s. 


Proof The proof follows from Proposition ^. 31 Note that the Luenberger observer is a special 
case of the interconnected observer with N — 1. ■ 


Remark 3.5 Assumption 2) of Proposition [Q1 is a special case of assumption 3) with ma¬ 
trices P = I and Pq — I. Note that the boundedness property in item 2) in Theorem 3.f 


guarantees that the rate of convergence of the interconnected observers is faster than or equal 
to that of a Luenberger observer by comparing the ICC estimates they induce (which is a rea¬ 
sonable measure of performance when the ICC functions are derived using similar bounding 
techniques). 


The ICC bounds established in Proposition 13.31 characterize a worse case property of the 
estimation error of the proposed observer, which can be compared to that of a Luenberger 
observer via Theorem 13.41 The following example illustrates this point. 
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= X 2 = \{xi + x 2 ) 

Figure 3: Two agents connected as a direct graph. 


Example 3.6 We revisit the motivational example in Section [H and design an interconnected 
observer with N = 2 with an all-to-all graph as shown in Figured Consider the case when 
two agents are experiencing common noises mi = m 2 = m. The transfer functions from m 
to eL and from m to e (global) are given b$_\ Tl(s ) = s _ 1 ( )_f I<L and T(s ) = C(sl — A)~ 1 B. In 
particular, the proposed observer takes the form 

Xi = ax i - K u (yi - y) - K l2 {y 2 -y), x 2 = ax 2 - K 22 (y 2 - y) - K 2l (y l - y), 

„ ^ ^ _ _ x i + x 2 (1^) 

y i=xi, y 2 = x 2 , xi = x 2 = —-—. 


Then, we have the following result. 

Proposition 3.7 Given a,I±L E M such that a / 0 and a — K L < 0, then there exist 
K\\, K 22l K\ 2 , K 2l e M such that the rate of convergence of the observer (jTT]) is no smaller 
than that of the one induced by the Luenberger observer and the H^ norm of T is smaller 
than the H^ norm ofT L , i.e., HTH^ < ||T L ||oo- 


Proof The proof can be found in Appendix \B[ ■ 

It should be noted that averaging the estimates of two uncoupled single Luenberger observers 
(one at each agent) does not lead to both faster convergence rate and smaller steady state 
error (see AppendixWl). The error dynamics of ([6j) can be written as 


e = Ae + Km, A 


a — An — K 12 
—K 2 i a — K 22 


A n + K 12 
K 2 \ + K 22 


(18) 


where e = [e\ e 2 \ T . The steady-state value of (fT8|) is given by e* = [e\ e^] 1 ", where 

K\ i K' 2'2 — R12K21 — K\i a ~ Ai 2 a * K11K22 — K12K21 ~ K 2 2 a — R 2 i a 

p == _ >n p == _m 

1 K u K 22 - K\ 2 K 2 \ - K 22 a -K u a + a 2 ’ 2 K ,, K 22 - K V 2 K 2] - K 22 a - K t ,a + a 2 ' 

The estimation error of the proposed observer is given by the quantity 


e* := Xi — x = 


ei + e 2 
2 


*£{1,2}, 


(19) 


and has a steady-state value given by 


_* KuK 22 - K\ 2 K 2 \ - (1/2) {K u + A" 22 + K 12 + K 21 )a 

p — p — _m 

1 ' 2 K U K 22 - K 12 K 21 - K 22 a - Kna + a 2 

4 For the particular choice of parameters Ku = K 22 = Kl and K\ 2 = K 2 i = 0, H^Hoo = HT^Hoo. 


( 20 ) 
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Under the condition that the matrix A is Hurwitz, its eigenvalues can be written in the general 
form Ai 2 = — cr ± joj, where a is positive and oo G M. Then, referring to Theorem \3.f\ by 


comparing the ICC bounds, the following conditions guarantee a faster convergence rate of 
the proposed observer: 


— a = 


<K 


11 


(A: 


22 


< cl — Kl < 0, {{K u — a) + (K 22 — o)) <4 det A, (21) 


where det (A) = KuK 2 2 — Ki 2 K 2 i — K 22 a — Kua + a 2 . Furthermore, in order to assure an 
improvement on the effect of measurement noise, the steady-state values e* and e* L should 
satisfy |e*| < \e* L \, which leads to the following condition: 


( 22 ) 


Now, to perform a numerical comparison, we consider the case where a = —0.5 and m : 
M>o —* M is a continuous bounded function. A Luenberger observer is designed following (J5]) 
to achieve a convergence rate of 2.5 and an H^ gain from m to equal to 0.8, which leads 
to Kl = 2. For the interconnected observers m, using Theorem \3-4\ conditions 2) can be 
rewritten as 


A'nA'22 - K 12 K 21 - (1/2)(A'n + A'22 + A ' 12 + A' 2 i)a 


K l 

A'nA'22 - K 12 K 21 - K 22 a - Kua + a 2 


K L -a 


a(A) < a — K l , 


V 2 P(Ku + Kn ) 2 + (A ' 22 + K 21 y 
2 | 


< 


Kr 


(23) 


From solving (T23l) . we pick parameters Ku = 1.7896, A '22 = 2.2278, K V2 = 0.0538, K 2 i = 
— 1.1633. It can be verified that the eigenvalues of A according to this set of parameters are 
—2.5087 ± 0.12087. Moreover, p(A) = -1.9123. 

Now we perform simulations using these parameters and different measurement noises. 
With initial conditions x(0) = 3, 75(0) = 75(0) = 75,(0) = 5, the first simulation is ran 


for measurement noise m{t ) = 0 and the resulting trajectories are shown in Figure 4(a) 


This figure shows that the interconnected observers converge at a faster rate compared to 
the Luenberger observer. In fact, item 2) of Theorem \3-4\ holds with t* ~ 6.7s. Simulation 
results for m(t ) = 0.3 are shown in Figure 4( c ) The behavior of the interconnected observers 


with constant noise is similar to that of with zero noise. It is worth to note that there is 
an improvement of the steady-state error by the interconnected observers since e* = 0.2272, 
while the Luenberger observer gives e* L = 0.2400. As shown in Figure 4( c ) < around t ~ 
2s, e becomes closer to 0 than e5 thereafter. To further explore the performance of the 
interconnected observers, we also consider measurement noise with different frequencies, i.e., 
a low frequency noise m(t) = 0.3 + 0.3sin(20t) and a high frequency noise m(t ) = 0.3 + 
0.3sin(200t). In order to clearly determine the properties of the interconnected observers, 
the tails of the simulations are shown in Figure 4( e ) an d Figure 4(f)' respectively. The 
advantage of the interconnected observers lies on the properties of damped oscillatory behavior 
and smaller mean value of estimation error. Specifically, a numerical comparison of the 
estimation errors after transient is reported in the first two columns of Table{l 1 which confirm 
the improvements guaranteed by the interconnected observers. A 
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(a) Estimation errors (m(t) = 0) 
for Luenberger observer and the in¬ 
terconnected observers (N = 2). 



(b) Difference between estimation 
errors of Luenberger observer and 
that of the interconnected ob¬ 
servers (N = 2 and m(t) = 0). 



(c) Estimation errors (m(t) = 0.3) 
for Luenberger observer and the in¬ 
terconnected observers (N = 2). 



(d) Difference between estimation 
errors of Luenberger observer and 
that of the interconnected ob¬ 


servers (N = 2, m(t) = 0.3). 



(e) Tail of estimation errors with (f) Tail of estimation errors with 
low frequency sinusoid noise. high frequency sinusoid noise. 


Figure 4: Comparisons of estimation errors of the proposed observer and that of a Luenberger 
observer for different measurement noises. 


3.5 Design via feasibility/optimization problems 

The interconnected observers in (fTT| can be designed by solving feasibility and optimization 
problems that minimize the Hoo gain of the transfer function from measurement noise m 
to estimation error e (global) or e t (local) under the rate of convergence constraint. To 
formulate such problems, following [28], the error system in (fT3l) is rewritten as 

e = A e e + u, y e — C e e + m, = Xe, (24) 
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Table 1: Comparison of estimation error (e) of the observers with measurement noise of 
different frequencies. 


observer type 

low freq. noise 

high freq. noise 

H 0 o gain from m to e 

mean e 

std e 

mean e 

std e 

Thm. IQ 

Thm. l3T0l 

Luenberger 

0.2419 

0.0211 

0.2395 

0.0022 

0.8000 

0.8000 

interconnected 

0.2286 

0.0154 

0.2268 

0.0016 

0.7572 

0.4953 

improvement (%) 

5.5 

27.0 

5.3 

27.3 

5.4 

38.1 


where A e = I N <g> A, C e = —1^ <S> C , and the “input” u is assigned via u = M u y e with 
M u = JC * G T . Note that denotes the overall estimation error (or the local estimation 
error) of the interconnected observers, i.e., z^ — e with X — C (or = e, with X = C t ). 
In the s-domain, the transfer function from m to z x for (1231) can be written as 

T(s) = X{sl -A)~ 1 B + V 1 (25) 

where A = A e + M u C e , B = M u , and V — 0. Within this setting, feasibility (he., inequalities) 
and optimization problems associated with the design of the interconnected observers are 
formulated in the following sections. 

3.5.1 Rate of convergence and gain in terms of matrix inequalities 

To guarantee that the rate of convergence of the interconnected observers is better (or no 
worse) than that of a Luenberger observer, the eigenvalues of the error system in (fT3[) will be 
assigned to the left of the vertical line at —cr in the s-plane, where a is the rate of convergence 
for the Luenberger observer. Following [29] , the error system has all eigenvalues located 
to the left of — cr on the s-plane if and only if there exists a matrix P$ such that 

He(^4, Ps) + 2 aPs < 0, P$ = Pj > 0. (26) 

Note that (1251) is nonlinear because of the cross term F s (/C * G T ) obtained when expand¬ 
ing PsA. The following theorem provides an equivalent linear formulation and a sufficient 
condition for (]26|) . 

Proposition 3.8 Condition (1251) is satisfied if 

a) and only if H e(A e , Ps) + CjMj + M p C e + 2aPs < 0, 

Ps = Pj > 0, in which case M u = P^Mp,- 

b) the graph is all-to-all connected and there exists hi, / 12 GI such that the following condi¬ 
tions hold: 

b.l) h l + h 2 > a; 

b.2) Pi = Pfi > 0 for each i G V 
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b.3) He((/1 — K U C ), Pj) + 26-iPj < 0 for each i G V; 



2h 2 P\ 

oT 

°12 

s 12 • 

2h 2 P 2 

■ Sin 

S 2 N 

< 0 , where = —(KjiC) T Pj — P % K i3 C 

b.4) 


oT 

°1N 

q t 

°2 N 

2h 2 P]\r 



Proof Letting M p = PgM u , and using the definition of A, inequality (12H can be written as 


He(A e , P s ) + CjMj + M p C e + 2 aP s < 0, 


with Pg = PJ > 0. This proves item a). Now, assuming 6.1)-6.4) with hi, h 2 G K, note that 
the inequalities in 6.3) can be rewritten as 


' Qi 

0 • 

• 0 


' 2 hiPi 

0 

0 

0 

Q2 • 

• 0 

+ 

0 

2/11P2 

0 

0 

0 • 

• Qn 


0 

0 

2 hi P^ 


with Qi = He((7L — KuC ), Pf) for each i G V. By 6.2), symmetry of the inequalities (127)1 and 

6.4) , and the definition of negative symmetric matrices, the sum of the left terms of (127)1 and 

6.4) satisfies 


Q 1 

Sl2 ' 

• Sin' 


2(hi + h 2 )Pi 

0 

0 

^l T 2 

Q 2 

' S 2 n 

+ 

0 

2 (hi + h 2 )P 2 

0 

,^i T v 

SJn ' 

Qn 


0 

0 

• 2(hi + h 2 )PN 


< 0 , 

( 28 ) 


with Sij = —( KjiC) T Pj — PiKijC for i,j G V and j ^ i. Since hi + h 2 > a, (126|) is satisfied 
with P D = diag(Pi,..., P N ). ■ 


Proposition 3.9 Conditions 6.1)-6.4) in Proposition \3.8 1 hold if and only if there exist 
hi, h 2 , G M, Y t , Wij, Pi for i,j G V and j ^ i such that: 

a) hi + h 2 > a, 

b) Pi = Pj T > 0, for each i G V, 


c) He(A, Pi) - C T Y l T - YiC + 2h x P t < 0, for each i G V, 


2h 2 P\ R\2 

R 2 i 2h 2 P 2 


Rin 

R 2 n 


< 0, where R {j = -C T Wj t - W tJ C. 


Rm Rni ■ ■ ■ 2h 2 P;f 
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The conditions 6.3) -6.4) in Proposition Iff.ffl hold with K u = P t l Yi and K t] = P i 1 W i j for 

hi e v, jV *■ 

Proof Let Y) = PiKa and = P t Kij for i,j 6 V and j 7 ^ i, then, 6.3)-6.4) in Proposi¬ 
tion 13.81 can be rewritten as 


for each i 6 V and 


He(H, Pi) - C T Y t T - YiC + 2h 1 P i < 0 


2h 2 P\ R 12 

R. 2 \ 2h 2 p 2 


Rin 

R2N 


< 0 , 


Rm 


Rm 


2h 2 P N 


respectively. Therefore, c) and d) of Proposition 13.91 hold. 


3.5.2 Minimization of p[ 00 norm under rate of convergence constraint with fixed 
connectivity graph 

In this section, we consider the design of interconnected observer over a fixed digraph T = 
(V,£,G). The design specifications of onr interest are the rate of convergence and the 
gain from noise m to estimation errors e or e*, i.e., the C 2 gain. In particular, to guarantee 
that the rate of convergence of the system (TT3h is better (or no worse) than that of a single 
Luenberger observer as in (I2l) , the eigenvalues of the error system (U3l) will be assigned to 
the left of the vertical line at —a in the s-plane, where cr is the convergence rate for the 
Luenberger observer. 


Theorem 3.10 Given a plant as in © and a digraph P, the rate of convergence is larger 
than or equal to a and the global H oo gain (respectively, the local gain) from m to 
estimation error e in (fT3]i (respectively , e* in (1T2 ]) ) is minimized if and only if there exist 
matrices 1C, Ps, and Ph such that the following optimization problem has a solution: 


inf 7 

s.t. He (.A, Ps) + 2 aP s < 0 , 


(29a) 

(29b) 


' H e{A,P H ) 

PhB 

T t ' 



B t P h 

- 7 / 

0 

< 0 , 

(29c) 

X 

0 

-7 1 . 



cT 

A 

Ph = 

o' 

A 

(29d) 


where X = C (respectively, X = Cj andCi is the sub-matrix of C from the (in — n+l)-th row 
to the ( in)-th row). 
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Proof From [30] Theorem 2.41], the H^ gain for a system from input to output with 
realization 7\(s) = CfisI — Ai)~ l Bi is less than or equal to 7 if and only if there exists some 
Ph = P~h > 0 such that 


H e(A u P H ) PhB 1 CJ 
Bj P H -7 1 0 

Ci 0 - 7 / 


< 0 , 


(30) 


Then, for system (fT3l) with T(s) = C(sl — A)~ 1 B, we have that the global Hoo gain from m 
to e is less than or equal to 7 if and only if (l30l) holds with A\ — A, B\ — B and C\ = C, 
which leads to (I29cl) with X — C. The same argument applies for Tfis) = ~~ A)~ l B 

which leads to (I29cl) with X = C*. Then, the proof finishes by adding constraint (1261) . ■ 


Remark 3.11 For a fixed connectivity graph, the optimization problem in fl29l) can be solved 
offline. Moreover, due to the form of the observer at each node as in cn, the information 
needed by each agent is what the neighbors provide through the connectivity graph. Therefore, 
the resulting observers for each agent are decentralized. 


Note that the optimization problem (129|) is not jointly convex over the variables (P 5 , Ph, 
M u ). Moreover, it is nonlinear because of the existence of cross terms PhM u and PsM u . 
In order to remove the nonlinearities and make the two constraints jointly convex, following 
[28] . we reformulate the problem by seeking common solutions of P$ and Ph, and changing 
variables to M p := PM U . Using item a ) of Proposition 13.81 to rewrite the terms He(M, Ps) 
and Be(A, Ph) hi (l29]h we have the following result. 


Theorem 3.12 Given a plant as in © and a graph T, the rate of convergence is larger than 
or equal to a and the global gain (respectively, the local H 00 gain) from m to estimation 
error e in (fT3l) (respectively, <7 in (1T2 |) ) is minimized if there exist M p and P such that the 
following optimization problem (LMI) is feasible: 


inf 7 

s.t: H e(A e , P ) + CjM p + M p C e + 2aP < 0, 

M p X T 

-7 1 0 <0, 

0 — 7 / 

p = p T > 0, 


H e(A e ,P) + CjM p + MjC e 
Mj 

X 


where X = C (respectively, X = Ci and Ci is the sub-matrix of C from the (in — n+ 1 )-th row 
to the (in)-th row). 


Remark 3.13 The resulting observer gain matrix from Theorem \3.1 r z\ is given by M u = 
P~ 1 M p . By making the optimization problem linear and convex, a global optimizer is guar¬ 
anteed. However, asking for common solution of Ph = Pd 'may eliminate a better feasible 
solution to the original optimization problem in (1291) . 
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Following pH, [32], it is possible to formulate an equivalent convex optimization problem 
to the one in Theorem 13.121 but with noncommon Pjj and Ph matrices, see Appendix [F] 
Next, we provide an example to illustrate the results above. 


Example 3.14 We revisit the motivational example with connectivity graph as in Figured 
To further indicate the improvement obtained by the proposed observer, we choose Ku = 
K 22 = Kl = 2, and K V2 = —0.5 Kl = — 1. The resulting local gain from m to e\ 
is 0.55, which is smaller than that of the Luenberger observer, which is 0.8. If instead 
the connectivity graph in Figure 0 is considered, we can further optimize the parameters 
by solving the optimization problem (l29]h Feasible parameters for (1291) are found using the 
solver PENBMI JSf. For K u « 3.5198, K 22 » 0.4802, K V1 « -8.0142, K 21 « 0.2883, the 
resulting global gain is ~ 0.4953, which is ~ 38.09% smaller than that of Luenberger 
observer (which is 0.8 with Kl — 2). This improvement and the improvement obtained when 
using Theorem \3.4\ are listed in the last two columns of TableUi 

In fact, when the rate of convergence specification is a = 2.5, and the H 00 gain from m 
to e is restricted to be less than or equal to 0.8, then, by letting Ku = 2 and K 22 = 2, we can 
find the feasible region for K V2 and K 2 \ as shown in Figure 5(a). Moreover, if the rate of 
convergence is required to be a = 3.0 with the same H 00 constraint, then, by letting Ku = 2.5 
and K 22 = 2.5, we obtain the feasible region for K\ 2 and K 2 1 as shown in Figure 5(b). As the 


figure suggests, faster rate of convergence leads to a smaller feasible region for the observer 
parameters. More importantly, for a single Luenberger observer, there is no feasible solution 
for rate of convergence larger than or equal to 3.0 and global H 0Q gain less than 0.8. 




0.79 

0.78 

0.77 

0.76 

0.75 

0.74 

0.73 

0.72 

0.71 


(a) Regions for rate of convergence equal 2.5 (b) Regions for rate of convergence equal 3.0 


(■ K 12 = K 21 = 2). 


(K 12 = K 21 = 2.5). 


Figure 5: Feasible regions for observer parameters subject to different rate of convergence 
specification and global 11^ gain less than 0 . 8 . 


Now, for the same plant, consider digraphs with 6 agents where the edges are defined 
as in Figure 0 In all cases, each agent is self connected. Let Mi denote the number of 
non-self edges for agent 1, e.g., when Mi = 0 as shown in Figure 0 it is implied that 
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© © 

o o 

©O o© 

Ml = 0 

Figure 6: Different graph structures for agent 1 with N — 6. 

Table 2: Comparison of local H 00 norms from noise m to e\ with different number of incoming 
edges for agent 1. 



number of non-self edges (Mi) 

0 

1 

2 

3 

4 

5 

local 

0.80 

0.45 

0.34 

0.28 

0.25 

0.22 

improv. (%) 

0.00 

43.8 

57.5 

65.0 

68.8 

72.5 



G = I a, while when Mi — 5, G — [ g± g 2 ], gi — [1 lJ] T and, g 2 = [0 fi} T . Let the rate of 
convergence specification be a = 2.5. Then, the local H 00 norms from noise m = (mi,..., me) 
to estimation error e± at agent 1 for the cases in Figure 0 are shown in Table [E From case 
Mi = 0 to case Mi = 1, the improvement is significant; in fact, when an incoming edge 
is added to agent 1, the local H <*, is improved by 43.8% when compared to the case where 
a single Luenberger observer is used at agent 1. When two agents provide information to 
agent 1 (Mi = 2), the improvement is approximately 57.5% , while when three and four agents 
communicate to agent 1, the improvement grows to approximately 65% and 69% (Mi = 4 ), 
respectively. A 


(JTJ) with A = 


Example 3.15 (second order plant) First, we consider a second-order plant given as in 

4/100 "^3^ ’ ^ = [ ^ ] • For a given Luenberger observer with Kl = 

[1.5 —0.16] t , its rate of convergence is —3.34 and its H 00 norm from measurement noise m 

to estimation error is equal 0.34. With the interconnected observers for N = 2 connected 
via an all-to-all connectivity graph, we obtain that the optimal global Hoo norm from measure¬ 
ment noise m to estimation error e is approximately 0.05 and the optimal local H 00 norm from 
m to ex (or e 2 ) is 0.03 with M u = [vi v 2 ], where Vi = [10.3834 — 1.6019 — 10.7581 1.5963] T 
and v 2 = [7.1992 — 1.2410 — 7.3028 1.2426] T . This is ~ 85.29% smaller than that of 
Luenberger observers. 

Then, we consider a second-order plant with oscillatory behavior given as in (HD with 


A 


0 -1 

1 0 


C — [ 1 0 ] . For a given Luenberger observer with A© = [2 0] T , its 


rate of convergence is — 1 and its F[ 00 norm from measurement noise m to estimation error 
Pl is equal 2. With the interconnected observers with N = 2 connected via an all-to-all 
connectivity graph, by formulating the problem according to (1241) . the optimization problem 
in Theorem \3.10\ is solved and the gain matrix is found as M u = [nx v 2 \, where Vi = 
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[7.9503 - 9.9554 - 5.9424 9.0014] T and v 2 = [-5.9324 9.1143 7.9605 - 9.8426] T . Its 
corresponding global norm from m to e is ~ 1.4142 and its local norm from m to ey 
(or e 2 ) is ~ 1. Comparing to the Luenberger observer, the global norm is decreased by 
~29.3% and the local H 0Q norm is decreased bypz 50.0%. A 

The improvements on the local gain guaranteed by the proposed interconnected observers 
in the examples above are justified by the fact that the sufficient condition given in the 
upcoming Section [3761 are satisfied; see Theorem 13.201 and below it, where these examples are 
revisited. 


3.5.3 Minimization of norm under rate of convergence constraint with op¬ 
timized connectivity graph 

For interconnected observers whose digraph has not yet been specified, a natural question 
to ask is whether there exists a digraph that minimizes the number of links between agents 
for the given specifications. More precisely, given a rate of convergence a and a desired H 00 
gain 7 *, find a digraph with minimum number of edges. In applications, such minimizations 
could potentially lower the cost of a distributed system as it could reduce the number of 
agents and communication links. The following result provides a sufficient and necessary 
condition for such optimization problem. 


Theorem 3.16 For the error system f[T3lh the rate of convergence is larger than or equal 
to a and the global H 00 norm (respectively, the local H 00 norm) from noise m to estimation 
error e in (1T5D (respectively, e* in (fT2]l ) is less than or equal to 7 * over a digraph V with 
minimized number of edges if and only if there exist matrices 1C, G, Ps, and Ph such that 
the following optimization problem has a solution: 


inf tr(D) (31a) 

s.t. He(M, Ps) + 2 aPs < 0, (31b) 


' H e{A,P H ) 

PhB 




B t P h 

-7 *1 

0 

< 0 , 

(31c) 

X 

0 

-7 */ 



cT 

A 

Ph — Ph ^ 0 , 


(31d) 


where X = C (respectively, X = C). 


Proof Following the proof of Theorem 13.101 the global H a 0 gain over a digraph I is less 
than or equal to 7 * if and only if 


holds with Ai — A, B t — B, C\ = C, 7 = 7 * and 
Ph = Ph > 0. The same argument applies to the local gain. Moreover, the rate of 
convergence condition is satisfied if and only if (131b|) holds. Since tr (D) = YliLiY2j=i 9iji 
where g^ = 1 indicates there is an edge from node j to node i, then the number of edges of 
the graph is minimized if and only if tr(H) is minimized. ■ 
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The constraints in (131 bp and (j31c[) are nonlinear and not jointly convex. By changing 
variables, the nonlinear constraints in (13 lb[) and (131 cl) can be linearized as a function of Q 
and P. 


Theorem 3.17 For the error system (fT3l). the rate of convergence is larger than or equal 
to a and the global norm (respectively, the local H 0Q norm) from noise m to estimation 
error e in (1T51) (respectively, e* in (1T2]) ) is less than or equal to 7 * over a digraph T with 
minimized number of communication links if there exist matrices 1C, G, and P such that the 
following optimization problem is feasible: 

inf tr(T>) 

s.t. Ae(I N < 8 > A, P) - Q(I n < 8 > C) - ( Ijv < 8 > C) t Q t + 2aP < 0, 

' H e(I N < 8 )A,P)~ Q(I n ® G) - (I N <8> C) T Q T Q X T 

Q T - 7 */ 0 

X 0 - 7 */ 

p = p T > 0, 

where Q = P{)C * G T ), and X = C (respectively, X = Ci). 

Proof Let 1C, G and P be solutions of the optimization problem (1521) . Since the matrix 
1C * G J is such that Q = P(f C * G T ), using P = P T and the definition of A in (115]) . we have 

H e(I N ( 8 ) A, P) - Q(I n < 8 > C) - (I N ® C) T Q T 

= (I N ® A) t P + P{I N (8) A) - (. I N < 8 ) C) T (1C * G t ) t P t - P{K * G T ){I N ® C) = He(^, P ). 
Then, /C, G, Ps = P and Ph = P satisfy (151]) . ■ 

Remark 3.18 The results above define the graph via the resulting G. The resulting 1C and 
G from the optimization problem (1521) satisfies 1C * G J = P^ l Q, which may not be unique. 


(32a) 

(32b) 

< 0, (32c) 

(32d) 


Example 3.19 Consider the scalar plant in (J1P with a — —0.5 as in Example \S.1J\ which 
can represent the dynamics of a mobile agent whose state is to be estimated using multiple 
sensors either fixed or mobile (in relative coordinates). Suppose that the rate of convergence 
specification is a = 2.5. When using the graph that is all-to-all as shown in Figure 7(a), it 


is natural to ask the effect that the number of agents has on the improvement of the global 
tfoo norm. As shown in Figure\7(b)\ the resulting global F [ 00 gain is reduced as the number 


of agents N grows. These results are obtained following Theorem \3.16\ . The improvement 
is summarized in Table 0 Note that the improvement is less significant for N > 6 . In 
particular, the table indicates that if the global Hoa gain is required to be less than or equal 
to 0.40, then, as shown in Figure 7(b), the least number of agents needed is thre^. For 


D The optimization problem related to the examples shown in this paper are solved by PENBMI 
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(a) Graph structures with all to all connec¬ 
tions (self connection links are not shown). 
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(b) The global Hoo norm from noise m to estima¬ 
tion error e with respect to the number of agents. 
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Figure 7: The influence of the number of agents on the H 00 gain from noise m to estimation 
error e. 


Table 3: Comparison of global H 00 norms from noise m to e with different number of agents 
under all-to-all connection._ 



number of agents ( N ) 

1 

2 

3 

4 

5 

6 

7 

global Hoo 

0.80 

0.54 

0.40 

0.34 

0.33 

0.32 

0.31 

improv. (%) 

0.00 

32.5 

50.0 

57.5 

58.8 

60.0 

61.3 

local Hoo 

0.80 

0.38 

0.24 

0.23 

0.21 

0.20 

0.19 

improv. (%) 

0.00 

52.5 

70.0 

71.3 

73.8 

75.0 

76.3 


the same scalar plant with three interconnected observers, according to Theorem \3.16l we 
establish a relationship between tr (D) and the global gain from m to estimation error e 
in Table [7} In particular, for tr (D) smaller than six, there is no improvement in the H^ 
gain when compared to that of Luenberger observers. Moreover, the table indicates that, with 
three interconnected observers, if the global Hoo gain is required to be less than or equal to 
0.6, then the minimum number of links required in the connectivity graph T is seven. A 


Table 4: Comparison of global norms from noise m to e with different connectivity graph 
with N = 3. _ 



tr(D) 

6 

7 

8 

9 

global Hoo 

0.64 

0.53 

0.43 

0.40 

improv. (%) 

20.0 

33.8 

46.3 

50.0 
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3.6 A sufficient condition guaranteeing smaller local H ^ gain 

In this section, we are interested in conditions on the plant (JT1) for which it is possible to 
design interconnected observers that, for a given rate of convergence a, have local gains 
smaller than when a single Luenberger observer is used at each agent. Note that the local 
gain affects the quality of the estimates obtained at each node. These estimates can be 
computed efficiently and in a decentralized manner using local information, while computing 
the global estimate requires additional algorithms - see Section 14.21 The following result 
provides one such condition. 


Theorem 3.20 Given a > 0, suppose Kl is such that the eigenvalues of the error system 
([3]) of the Luenberger observer ([2]) for the plant (0Q) are located in the region V = {s E 
Co : Re(s ) < —o'}, and the gain from m to is 7 l > 0. If there exist del and 
P = P T > 0 such that 


Re(A-K L C,P ) PK l C -al n 
C T KJP -I n (1 + a)I n 

Qtln (1 T a)I n In 


< 0 , 


(33) 


then, for every N E N, N > 1, there exist a digraph T and a gain 1C for N interconnected 
observers in (fill) such that the error system (1131) has its eigenvalues in V and the local H 
gain from m to associated e* for all agents are less than or equal to r ) L . Moreover, for at 
least N — 1 agents, the local H 0Q gain from m to associated e* is strictly less than 7 l- 


Proof For any N > 1, let the digraph T have adjacency matrix 


G n = 


1 Ijv-i 
0 In-i 


(34) 


This choice of G indicates that agent 1 can share information with all other agents. Moreover, 
for each ieV, let T\ be the transfer function from m to <7. Take N = 2 and Kn = K 22 = Kl, 
K 12 = 0, and K 2 i to be determined later. Then, the interconnected observers in (fTTj) reduce 
to 


x 1 = Axi - K L (yi-yi), x 2 = Ax 2 - K L {y 2 - y 2 )-K 2 i{yi-yi), 

. _ X! + x 2 (35) 

yi = Cxi, y 2 = Cx 2 , x x = x^ x 2 = —-—, 

with associated error system as in (fT3j) with 


A = 

' A - K l C 0 

B = 

' Pf. 

0 


-K 21 C A - K l C 

5 ^ 

K 21 

k l 


If K l is such that ([2]) has its eigenvalues in P={sgC 0 : Re(s ) < — a}, then, due to the block 
matrix form of A, the eigenvalues of A are also in T>. Now, suppose (1331) holds with a£l 
and P = P T > 0. Then, if (133]) is treated as an constraint, equivalently, we have 


-a{sl - A L )~ l K L C + (1 + a)I 


< 1 . 

OO 


(36) 


24 















Therefore, the transfer function T 2 (s)=C 2 (sI—A) l B satisfies 


T 2 = C(sI-A)~ 1 B=-\l l\ 


si — Al 0 

-1 

Kl 

0 ' 

_ K 2l C si — A l 


k 21 

bi 

1 _ 


By using the inversion identity for a block matrix (inversion lemma), it follows that 


si - A l 

0 

-1 

^ —l 

1 

O 

1 

to 

O 

si - A l _ 


F (si — Al )- 1 _ 


where F = —(si — Al) 1 K 2 iC(sI — Al ) 1 Then, by assigning K 2 i = olKl, T 2 can be 
simplified as 

T 2 = [ \T l — | aT L CT L + | olT l ^T l ] , 
where Tl(s) = (si — Al)~ 1 Kl- Therefore, we obtain 

liniu < \ ll(i + <*)t l - ancTiiL + iipyu. 

Using (136|) . it follows that H^Hoo < = 7 ^. Now consider for any N > l,N e N, with 

digraph whose adjacency matrix is Gjy, it follows that the transfer function T t from noise m 
to e* satisfies T t = T 2 for all j e V, i ^ 1. Therefore, HTjUoo < 7 l for all i 6 V,i ^ 1. ■ 

Note that condition (full is a property on the plant for a given Kl ; basically, an Hoc inequality 
as in (I29ch . Next, we illustrate this condition in the examples throughout the paper. 

Example 3.21 For the scalar plant (J4J) with the Luenherger observer (J5]l . the transfer func¬ 
tion in the s-domain from m to is given by T L (s ) = ■ Since fl33l) is an LMI with 

respect to P and a, its feasibility can be easily verified, e.g., for a = — 0.5 and Kl = 2, 
P = 0.47 and a = — 0.5 solve (dSD- Therefore, for the plant (J3J), there exist interconnected 
observers such that at least N — 1 local H <*, gains are smaller than 7 l = 0.8 with K L = 2. 
This justifies the improvement shown in the motivational example as in TableUl A 


Example 3.22 We revisit the systems in Example \S.15[ For the first system discussed in 
Example Iff. 151 the improvement is justified by the fact that condition (|33l) in Theorem \3.20l 
holds with a = —0.3241 and P = 0.1/. While for the other second-order plant with oscillatory 
behavior, the improvement is justified by the fact that condition (|33l) in Theorem [3JM holds 
with a = 0.8631 and P = [wi w 2 \ with w\ = [0.1839 0.01 IT] T and w 2 = [0.0117 0.1722] T . A 
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4 


Discussion 


4.1 Optimizing the number of internal observers N 


The parameters of the interconnected observers are the gains K^s and the number of internal 
observers N. The optimization problems formulated in Section 13.51 optimize the gains for 
a prespecified value of N. Clearly, the larger the value of N is, the larger the size of the 
optimization problem to solve becomes. Moreover, it is expected that performance and 
robustness may only be improved up to a certain value of N, and increasing it further 
would not lead to a significant improvement. For a fixed rate of convergence constraint, 
Example 13. 1 91 shows the relationship between the optimal gain from m to e as a function 
of N for the scalar plant (j3J) with the interconnected observer via an all-to-all connectivity. 
Figure 7(b) suggests that, when N is larger than four, the improvement on the gain is 
not significant as its value tends to settle around a constant (~ 0.3). The objective function 
in the optimization problem in (I29c|) can be modified to include the number of internal 
observers N as an optimization variable. The objective function is given by Ci 7 g + c 2 N, 
where c\ > 0, C 2 > 0 are constant weights. With this new objective function, an optimization 
problem that optimizes the number of observers can be written as 


inf Ci7 5 + c 2 N 


s.t.: He(M, Ps) + 2 aPs < 0, 


He(M,P 5 ) 

PhB 

X 

B t P h 

-la 1 

0 

X T 

0 

-la 1 . 


< 0 , 


Ps = P] > 0, P H = Pi > 0, N 6 M, 


(37) 


where X = C (or Cj), Af C N and Af is bounded. It is worth to note that when c 2 = 0, the 
optimization reduces to the original one in (l29lh A way to solve problem (l37lh perhaps not 
efficiently, is by generating a collection of problems with N taking values from the finite set 
AT and then determining the one(s) with smallest gain. 


4.2 Consensus of the estimates in the nominal case 

The results in the previous section enable the design of interconnected observers as in (1TT| 
that meet specifications involving the rate of convergence, H 00 gains, and connectivity graphs. 
The local estimate could further be employed to determine a global estimate over the con¬ 
nectivity graph. Such an estimate can be obtained using consensus algorithms, in which case 
it will consists of a consensus problem of time-varying signals. When measurement noise is 
zero, the algorithm in [S3] can already be employed when generalized to the case of vector 
inputs. To this end, we attach to each agent an agreement vector and employ the following 
distributed algorithm to guarantee that each asymptotically approaches the average of the 
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local estimates, namely, ^ 

N N 

& =-Mg-*<+%, 4 = ft*EW, < 38 ) 

3 = 1 i =1 

for i G V, 1 < k < n, where £* = (f- ,... ,£f,... , £"); x/s are the estimates generated by 
agent i using the local observer in (TTTTh Vi is the auxiliary variable, and s are elements of 
the Laplacian C associated with the digraph h. The constants @2 G R are parameters to 
be determined. 

To analyze the convergence and stability of algorithm (l38lh following [23], it is rewritten 
as 

5 = ~/3i5 - fi 2 {C® I n )S - w, w = fafoiC® I n )8-U nN (x + fax), (39) 

where 6 = (<h, S 2 , ■ ■ ■, S N ), Si — & — A e V, and w — v - It niV (x + fdix). Following 

[2H Lemma 4.3], we obtain the following property. 

Lemma 4.1 For the plant in ©. assume the digraph V is strongly connected and weight 
balanced, where x t has the dynamics given in CD with mi = 0. Moreover, assume there exists 
/C in (1T5l) such that A is Hurwitz. Then, for any x(0),Xj(0),&(0) G R n , (3i > 0 ? /?2 > 0, 

and Uj(0) G R” such that YliLi Vi( 0) = 0, we have lim^oo YljLi %(£)) = 0 f or a ^ 

i G V. 

Proof The proof can be found in Appendix O ■ 

When the measurement noise m is not zero, due to the linear dynamics, we conjecture that 
the algorithm in (I38p has an ISS like property with respect to m, similar to the ICC bound 
in (fT6j) . Along with Theorem 13.201 such a property could potentially be used to characterize 
the improvement of the global H 00 guaranteed by the interconnected observers. 

4.3 Comparison between interconnected observers and the opti¬ 
mal observer/Kalman-Bucy filter 

It is well known that the Kalman-Bucy filter is the optimal observer that minimizes the 
mean square estimation error [34] • For the plant ([H) with x G R n , it is given by 

x K = Ax k ~ K(t)(y K - V ), (40) 

where xk G R n and t i —y K(t ) is the time-varying gain. Defining the estimation error as 
ex '■= Xk ~ x , the error system for the optimal observer/Kalman-Bucy filter i^ 

e K = {A - K(t)C)e K + K{t)m. (42) 

6 If (BUI) is initialized with xk{ 0) = %o '■= IE{a7(0)}, where E denotes the expected value function, then, 
according to [2] Theorem 4.5], for any positive definite symmetric weighting matrix function 1 K > W(t), the 
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While the expected value of the (weighted) norm of ex is minimized, the same trade off 
pointed out in Section [I] for Luenberger observers plays a key role in the design of (TUI . 
In fact, as page 346] correctly points out, “The optimal observer provides a compromise 
between the speed of state reconstruction and the immunity to observation noise.” Moreover, 
the design of (HOI) does not permit incorporating other performance indexes, such as the rate 
of convergence. On the other hand, the interconnected observers proposed here exploit the 
connections among agents over a graph to relax the constraints imposed by the said trade 
off and permits the incorporation of multiple objectives in the design. In fact, the results 
in this paper show that (140]) is not the optimal observer when performance specifications 
formulated in terms of eigenvalue constraints (relative to the optimal observer) are added. 
In this way, our results yield observers living in dimensions that are larger than that of the 
plant, leading to an approach in which the state estimation of systems in M n is performed 
using algorithms in W nN . 


5 Conclusion 

In contrast to standard observers for linear time-invariant systems, interconnected observers 
have the capability of attaining fast rate of convergence rate without necessarily jeopardiz¬ 
ing robustness to measurement noise in the Hoo sense. The comparison between ICC bounds 
between interconnected and Luenberger observers leads to checkable conditions that can be 
used for design - though potentially conservative. When solved for specific systems, the 
stated feasibility and optimization problems lead to significant improvements, when com¬ 
pared to single Luenberger observers. Such improvement is guaranteed by the satisfaction 
of an LMI condition. While the optimization of the number of internal observers and the 
connectivity graph are not necessarily linear and convex, numerical results for a particular 
plant indicate that the improvement obtained in robustness is significant only up to a finite 
number of such internal observers. 


References 

[1] D. Luenberger, “An introduction to observers,” IEEE Transactions on Automatic Con¬ 
trol, vol. 16, no. 6, pp. 596-602, December 1971. 

[2] H. Kwakernaak and R. Silvan, Linear Optimal Control Systems. John Wiley & Sons, 
Inc., 1972. 

gain that minimizes the objective function Efe# (t) T W(t)ex (i)} for all t > 0 is K(t) = QRjCAV^ -1 ^), 
where t^>-Q{t) is the solution of 

Q(t) = AQ(t) + Q{t)A T - Q{t)C T V^\t)CQ(t ), (41) 

from the initial condition Q(0) := E{(x(0) — xo)(x(0) — Xo) T }, where t Vd(t) is the (positive definite for 
each t > 0) covariance matrix of the measurement noise to. 



[3] S. Boyd, L. El Ghaoui, E. Feron, and V. Balakrishnan, Linear Matrix Inequalities in 
Systems and Control Theory. Siam, 1994. 

[4] H. Li and M. Fu, “A linear matrix inequality approach to robust H-infinity filtering,” 
IEEE Transactions on Signal Processing , vol. 45, no. 9, pp. 2338-2350, 1997. 

[5] J. Jung, K. Huh, H. K. Fathy, and J. L. Stein, “Optimal robust adaptive observer 
design for a class of nonlinear systems via an h-infinity approach,” in American Control 
Conference, 2006, June 2006, pp. 6 pp,-, 

[6] H. J. Marquez, “A frequency domain approach to state estimation,” Journal of the 
Franklin Institute , vol. 340, no. 2, pp. 147-157, 2003. 

[7] B. Egardt, Stability of adaptive controllers. Springer Verlag, 1979. 

[8] P. Ioannou and J. Sun, Robust adaptive control. Prentice Hall, 1996. 

[9] A. Astolb and L. Praly, “Global complete observability and output-to-state stability 
imply the existence of a globally convergent observer,” Mathematics of Control, Signals, 
and Systems (MCSS'), vol. 18, no. 1, pp. 32-65, 2006. 

[10] V. Andrieu, L. Praly, and A. Astolh, “High gain observers with updated gain and 
homogeneous correction terms,” Automatica, vol. 45, no. 2, pp. 422 - 428, 2009. 

[11] L. K. Vasiljevic and H. K. Khalil, “Differentiation with high-gain observers the presence 
of measurement noise,” in Proc. of 45th IEEE Conference on Decision and Control , Dec. 
2006, pp. 4717-4722. 

[12] J. H. Ahrens and H. K. Khalil, “High-gain observers in the presence of measurement 
noise: A switched-gain approach,” Automatica , vol. 45, no. 4, pp. 936-943, 2009. 

[13] A. A. Ball and H. K. Khalil, “Analysis of a nonlinear high-gain observer in the presence 
of measurement noise,” in Proc. of American Control Conference, July 2011, pp. 2584- 
2589. 

[14] R. G. Sanfelice and L. Praly, “On the performance of high-gain observers with gain 
adaptation under measurement noise,” Automatica, vol. 47, no. 10, pp. 2165-2176, 
2011 . 

[15] H. Lei, J. Wei, and W. Lin, “A global observer for observable autonomous systems 
with bounded solution trajectories,” in Proc. of 44th IEEE Conference on Decision and 
Control, Dec. 2005, pp. 1911-1916. 

[16] H. Shim, A. Tanwani, and Z. Ping, “Back-and-forth operation of state observers and 
norm estimation of estimation error,” in Proc. of 51st IEEE Conference on Decision 
and Control, Dec 2012, pp. 3221-3226. 


29 


[17] J. Cortes, “Distributed Kriged Kalman filter for spatial estimation,” IEEE Transactions 
on Automatic Control , vol. 54, no. 12, pp. 2816-2827, 2009. 

[18] R. Olfati-Saber, “Distributed Kalman filter with embedded consensus filters,” in Proc. 
of 44th IEEE Conference on Decision and Control, European Control Conference., 2005, 
pp. 8179-8184. 

[19] D. P. Spanos, R. Olfati-Saber, and R. M. Murray, “Approximate distributed Kalman 
filtering in sensor networks with quantifiable performance,” in Proc. of fth International 
Symposium on Information Processing in Sensor Networks, 2005. 

[20] P. Alriksson and A. Rantzer, “Distributed Kalman filtering using weighted averaging,” 
in Proc. of 17th International Symposium on Mathematical Theory of Networks and 
Systems, 2006. 

[21] R. Olfati-Saber, “Distributed Kalman filtering for sensor networks,” in Proc. of f6th 
IEEE Conference on Decision and Control, 2007, pp. 5492-5498. 

[22] R. Carli, A. Chiuso, L. Schenato, and S. Zampieri, “Distributed Kalman filtering based 
on consensus strategies,” IEEE Journal on Selected Areas in Communications, vol. 26, 
no. 4, pp. 622-633, 2008. 

[23] Y. Zhao, Z. Duan, G. Wen, and G. Chen, “Distributed 11^ consensus of multi-agent sys¬ 
tems: a performance region-based approach,” International Journal of Control, vol. 85, 
no. 3, pp. 332-341, 2012. 

[24] S. S. Kia, J. Cortes, and S. Martinez, “Dynamical average consensus under limited con¬ 
trol authority and privacy requirements,” International Journal of Robust and Nonlinear 
Control, 2013. 

[25] Y. Hong, G. Chen, and L. Bushnell, “Distributed observers design for leader-following 
control of multi-agent networks,” Automatica, vol. 44, no. 3, pp. 846-850, 2008. 

[26] J. Hu and X. Hu, “Nonlinear filtering in target tracking using cooperative mobile sen¬ 
sors,” Automatica, vol. 46, no. 12, pp. 2041 - 2046, 2010. 

[27] J. Hu, X. Hu, and T. Shen, “Cooperative shift estimation of target trajectory using 
clustered sensors,” Journal of Systems Science and Complexity, vol. 27, no. 3, pp. 413- 
429, 2014. 

[28] C. Scherer, P. Gahinet, and M. Chilali, “Multiobjective output-feedback control via LMI 
optimization,” IEEE Transactions on Automatic Control, vol. 42, no. 7, pp. 896-911, 
July 1997. 

[29] M. Chilali and P. Gahinet, U H 00 design with pole placement constraints: an LMI ap¬ 
proach,” IEEE Transactions on Automatic Control, vol. 41, no. 3, pp. 358-367, March 
1996. 


30 


[30] C. Scherer, “The riccati inequality and state-space -f^oo-optimal control,” Ph.D. disser¬ 
tation, Dissertation zur Erlangung des naturwissenschaftlichen Doktorgrades der Bay- 
erischen Julius Maximilians-Universitat Wurzburg, 1990. 

[31] Y. Ebihara and T. Hagiwara, “New dilated LMI characterizations for continuous-time 
multiobjective controller synthesis,” Automatica , vol. 40, no. 11, pp. 2003-2009, 2004. 

[32] Y. Ebihara and T. Hagiwara, “A dilated LMI approach to robust performance analysis 
of linear time-invariant uncertain systems,” Automatica , vol. 41, no. 11, pp. 1933-1941, 
2005. 

[33] M. Kocvara and M. Stingl, “PENNON: A code for convex nonlinear and semidehnite 
programming,” Optimization Methods and Software, vol. 18, no. 3, pp. 317-333, 2003. 

[34] R. E. Kalman and R. S. Bucy, “New results in linear filtering and prediction theory,” 
Journal of Fluids Engineering, vol. 83, no. 1, pp. 95-108, March 1961. 

[35] M. J. C. A. E. Frazho, Linear Systems and Control: An Operator Perspective. New 
York: Marcel Dekker, 2003. 

[36] H. K. Khalil, Nonlinear Systems. Pearson Education, Inc, 2002. 

[37] C. M. S. Mokhtar S. Bazaraa, Hanif D. Sherali, Nonlinear Programming: Theory and 
Algorithms. John Wiley & Sons, 2006. 

[38] P. B. T. Barsar, H°°-Optimal Control and Related Minmax Design Problems: A Dy¬ 
namic Game Approach. Birkhauser Boston, 1995. 

[39] B. D. O. Anderson and S. Vongpanitlerd, Network Analysis. Englewood Cliffs, NJ: 
Prentice-Hall, 1973. 

[40] W. Xie, “Multi-objective i/oo/a-stability controller synthesis of LTI systems,” IET Con¬ 
trol Theory & Applications, vol. 2, pp. 51-55(4), January 2008. 


A Proof of Proposition 13.3 


For a Hurwitz matrix A with distinct eigenvalues, we have the following properties [35] : 
(P2.1) |exp(Mi)| < k(M) exp(«(M)f), Vf > 0; (P2.2) Let <1>(t) = exp(Mi) for all t 6 M> 0 , 
then || < I ) ||i < |^pyy. Then, the solution of system (fi~3lh given by e{t) = exp(Mi)e(0) + 

f* exp(M(t — T))Bm{r)dT, can be bounded for all t > 0 as 


e{t) | < |exp(Mi)| |e(0)| + 


exp(M(t 


T))Bm{r)dT 


(43) 
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Let \4>(t) | 


f* exp(A(t — T))Bm{T)dr 


, then 


\4>(t)\ < \B\ f \exp(A(t - t))\ dr \m\oo < \B\ f |exp(M(T))| dr |m|oo < \B\ ||4>||i |m|oo. 

Jo Jo 

(44) 

Therefore, by using properties P2.1 and P2.2, inequality (l43li can be simplified as 

\B\ 

|e(t)| < k(A) exp(a(M)t) |e(0)| + k(A)-, — r - r - T |m| 00 . (45) 

HAl)| 

Furthermore, |e(t)| = |Ce(f)| < |C||e(£)| < k(A)\C\ exp(a(M)t) |e(0)| + k(A) |m|oo. Pick 
for s, t G M> 0 , /3(s,£) = k(A)\C\ exp(a:(M)£)s, <p(s) = k(A) s. It follows that (fl6|) holds. 

When A is dissipative such that A T + A < —2 al for some a > 0, following |35) Section 
3.2], we have (P2.3) *4.+ A T < 2p(M)/; (P2.4) | exp(Mi)| < exp(/i(M)t) for all t > 0; (P2.5) 
Let 4>(f) = exp(Mi) for all t > 0. Then, 114)111 < . Using properties P2.4 and P2.5, 

inequality (l43li can be simplified as |e(£)| < exp(/i(M)£) |e(0)| + \ m \oo- Then, if follows 
that |e(t)| = |Ce(£)| < |C||e(t)| < |C| exp(/i(M)£) |e(0)| + \m\co. For each s G M> 0 and 
t G M>o, dehne /3(s,t) = \C\ exp(/i(M)£)s, <p(s) = s. It follows that (TT6l) holds. 

When there exists P = P T > 0 such that A T P + PA < —2 aP for some a > 0, 
consider the Lyapunov function U(e) = e T Pe. Then, V has the following properties: (P2.6) 
A min (P)|e| 2 < V(e) < A max (P)|e| 2 ; (P2.7) (VV(e),Ae) < -2aA min (P)|e| 2 ; (P2.8) |VU(e)| < 
2A max (P)|e|. Moreover, the derivative of the function V(e) = e T Pe with respect to time is, 
for each e G R n , 

V (e) = <W(e),e) = (VU(e), Ae + Bm) = (VU(e), Ae) + (W(e), Bm). (46) 

Using properties P2.7 and P2.8 as well as the Cauchy-Schwarz inequality, we get that for 
each solution 1 H > e(t) to (TT3|) and each t H > m(t) 

V(e(t)) < -2aA min (P)|e(t)| 2 + 2A max (P)|e(t)||P||rn(t)| (47) 

for all t > 0. To claim the desired bound on e(f) from (IT7|h using similar steps as those in 
the proof of [[361 Theorem 5.1], we dehne W(t ) = (e(t)). It can be shown that, for all 

values of V (e(t)), 

D + W(t) < + %£®m(f)|. 

''max ) V A m i n (P) 

Then, by a comparison lemma (see, e.g., [36, Lemma 3.4]), for all t > 0, W{t ) satisfies the 
inequality 

W(t) <exp(-Ai)ir(0) + /‘exp(-A(( - r))|m(r)|(ir, 

y Amin (P ) J 0 
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where A = • Using property P2.6 and W(t) = \JV (e(t)), it follows that for all t > 0, 


e(*)| < 


Amax(-P) 

^min (P) 


e(0)| exp(—At) + 


A max (P)|£| 

A min (P)|A| 


Then, it follows that 


(48) 


|e(t)| = \Ce{t)\ < |C||e(t)| < J^^p||C||e(0)| exp(-At) + H°°- ( 49 ) 

For every s, t G M> 0 , dehne (3(s,t ) = exp(— Xt)s, <p(s) = Then, 

dEl) holds. 


B Proof of Proposition 13.7 

Let Ku = K 2 2 = K l . Dehne the function pi : [0, cx)) — > [0, oo) as p\{x) = 4||T(jo;)|j 2 , where 
x = oj 2 for all u G K. Then, 

(2 K l + K 12 + K 21 )W + [2 K l (K l - a) - a[K 12 + K 21 ) - 2 K 12 K 21 } 2 ax + e 

PlW _ u 4 + [2In - K L f + 2A',,A' 2l ]n' 2 + ((a - A’ t ) 2 - K a K 21 f bx 2 + cm + d’ 

(50) 


where a = (2AU + a) 2 , b = 1, c = 2(a — AU) 2 + 2/3, d = [(a — AT) 2 — /3] 2 , e = [2K L (K L — 
a) — aa — 2/3] 2 , where a = Ki 2 + K 2 i and j3 = K 12 K 21 . It can be proved that the function 
c o I —> pi(x(uj)) is pseudo-concave on [0, oo) for (AT 12 ,AT 2 i) € V i, where the nonempty set 
T>\ = {( Ki 2 ,K 2 i ) G M 2 : (ck,/ 3) G 5ip|d> 2 } with Si = {(a,/3) G M 2 : ec — ad > 0} and 
S 2 = {(cx,/?) G M 2 : /3 = 0}; see Lemma IB.II (for convenience, in the definition of Vi, we 
write the condition on A '12 and AT 2 i in terms of a and j3). Moreover, pi is an even function 
on M, with maximum attained at each uj such that Vpi(o; 2 )2ci; = 0. Therefore, by properties 
of extrema of pseudo-concave functions (see, e.g., [371, pagel06]), its maximum is at u = 0 
or at co such that 


VpTa; 2 ) 


—aa; 4 — 2ec o 2 + ad — ec 
( 1 'jj 4 + an 2 + d) 2 


(51) 


Since, using the assumption a — Kl < 0, c and d are positive on V 1 , equation (T5TT) is 
equivalent to aa; 4 + 2ea; 2 — ad + ec = 0, which for a > —2AT has roots at 


u l,2 = 


—e + \J e 2 — a(ec — ad) 


W 3,4 = 


—e — \J e 2 — a(ec — ad) 


(52) 


Recall that a > 0 and that on the set T>i we have ec — ad > 0, which, since d > 0 due to 
a — Kl < 0, implies e > 0. Therefore, the roots in (1521) are complex conjugate. Then, for 
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(Ki 2 ,K 2 i) G T>i fl {(A"i 2 , A 21 ) G R 2 : ck > —2K L }, pi attains maximum at oj — 0, and so 
does u ||T(jo;)|| 2 , be., \\T\\l = ||T(0)|| 2 = if 

Now, we show the existence of parameters K u , K 22 , A 12 , K 2 \ for which the property 
HTHoo < 11Tf,11oo holds. We claim that it holds for Ku = K 22 = Kl, (K V2 , K 2 \) G V := 
V x nV 2 , where V 2 := {(A 12 ,A 21 ) G R 2 : 0 > K 12 > max{-2 K L , }, A 21 = 0}. 

Take (AT 12 , Xf 2 i) G XL We have that ||T(jc<;)|| 2 is given by \ of the right-hand side of (15U1) 
with a = (2 K l + A 12 ) 2 , b = 1, c = 2(a — K L ) 2 , d = (a - K L )\ e = [2 K L {K L - a) - aK 12 } 2 . 
Then, 


NT I |2 _ 1 e _ 1 [2 K l (K l -a)- aK 12 } 2 
11 1,00 4 d 4 {a-K L y 

Furthermore, the inequality ||T'||^ 0 < ||Tl||;^ leads to 

1 [2 K L (K L - a) - aK 12 ] 2 ^ K 2 
4 (o -Kl)‘ < (a — Kl) 2 ’ 


(53) 


(54) 


which holds on V. 

On the other hand, since Ku = K 22 = K L , when (Xf 12 , ih 21 ) G T>, the rate of convergence 
of the observer in ([6]) is \a — K L | by substituting these parameters in (117)) . which coincides 
with that of the Luenberger observer in (J5j). 

Note that the case where a > 0 and a — Kl < 0 can be proved similarly. 


Lemma B.l For system (JHJ) with Ki = K 2 = Kl such that a — Kl < 0, whose transfer 
function from m to e is T(s) = C(sl — A) l B, for each (K 12 ,K 21 ) G V 1 := {(Xf 12 , Xf 21 ) G 
R 2 : [2a 2 (a - K L ) 2 - (a - K L f\ (A 12 + A 21 ) 2 + [8 aK L [a - K L f - 4 K L (a - K L f} (A 12 + K 21 ) + 
4 K\[a — K l ) a > 0, K 12 K 21 = 0}, the function u H- ||T(ja;)|| 2 is pseudo-concave on [0, oo). 


Proof With the definition of A, B, C and K\ = K 2 = Kf , the transfer function T can be 
computed as 

T(s ) = C{sl - A)- 1 !! 

1 (2 K l + K 12 + K 21 )s + 2K l (K l - a) - a(K 12 + K 2l ) - 2K 12 K 21 

2 s 2 2(a - K l )s + (a - K L ) 2 - K 12 K 21 

Let a = K\ 2 + K 2 \ and /3 = A 12 Xf 21 , then we have 


1 (2 Kl + a)s + 2 Kl(Kl — a) — aa — 2/3 

2 s 2 - 2(a - K l )s + (a - K L ) 2 - P 


It follows that 


I TOw) 11 2 


1 (2 K l + a)W + [2 K L (K L - a) - aa - 2f3} 2 
4cu 4 + [2(a - K l ) 2 + 2/3]cu 2 + [(a - K L ) 2 - f3} 2 ' 


7 Since T L {s) = s Jff Kh and \\T L {ju)\\ = ^ 2+ ^_ a)2 > it follows that WT^ = sup weR \\T L (jw)\\ = 
k l 

Kl—CL ' 
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Note that pseudo-concavity of w ^ ||T(jo;)|| 2 is equivalent to pseudo-convexity of uj H > 
— ||T(jcc;)|| 2 . Define the function p : [0, oo) (->■ (—oo,0] as p(x ) = —4||T(jo;)|| 2 , where x = c j 2 
for all uj G M> 0 . Note that since uj H > ||T(//'a;)|| 2 is an even function on M, it is enough to 
discuss the case where uj > 0. Furthermore, let a = (2AD + a) 2 , b = 1, c = 2(a — AD) 2 + 2/3, 
d — [(a — Kl ) 2 — /3] 2 , e = [2AD(AD — a) — aa — 2/3] 2 , which will be treated as functions of a 
and /3, but for simplicity of notation, we do not explicitly write that dependency. Then, 

. , ax + e 

pix) = ——-- -. 

bx 2 + cx + d 


For any two points xi,x 2 G M> 0 , Vp(xi)(x 2 — Xi) > 0 can be written as 


— ax 2 — 2exi + ad — ec 
(bx\ + cx i + d) 2 


(x 2 - xi) > 0, 


which implies that 


(ax 2 + 2exi + ec — ad)(x 2 — Xi) > 0. 


(55) 


Since a > 0 and e > 0, then for each ( a , j3) G <Si := {(a, j3) G M 2 : ec — ad > 0}, (|55l) implies 
that x 2 > x i. Now, p(x 2 ) — p(x i) can be evaluated as 


P(x 2 ) - p(x i) 


ax 2 + e 


+ 


axi + e 


x 2 + cx 2 + d x 2 + cxi + d 


ax ix 2 + e(xi + x 2 ) + ce — ad 
(x| + cx 2 + d)(x 2 + cxi + d) 


(X 2 - Xl) 


(56) 

(57) 


Recall that the minimum of quadratic function x\ + cxi + d on M is attained at the point 
xi = — | and that the actual value of the function is — y + d. Therefore, for (ck,/3) £ 
S ‘2 := {(a,/3) G M 2 : — ^ + d > 0}, (x| + cx 2 + d)(x 2 + cxi + d) >0. Moreover, if 
(ck,/3) G «Si, using the property xi > 0, axix 2 + e(xi + x 2 ) + ce — ad > 0 since it is lower 
bounded by ax 2 + exi + ce — ad. Then, for any (a,/3) G «SiP|«S 2 , Vp(xi)(x 2 — Xi) > 0 
implies p(x 2 ) — p(xi) > 0. Therefore, by definition of pseudo-convexity, the function p is 
pseudo-convex on [0, oo) for each (a,/3) G «Si f) d> 2 . 

To show that the set <Si n^ 2 is nonempty, consider the special case where /3 = 0. Using 
the definitions of c and d, the set S 2 leads to the smaller set S 2 := {(a,/3) G M 2 : /3 = 0}. 
By using the definitions of a, c, d, e, (a,/3) G Si p| S 2 implies that a should satisfy [2a 2 (a — 
AD) 2 — (a —AD) 4 ]ck 2 +[8aAD(a —AD) 3 — 4A’i(a —AD) 4 ]a + 4A'2(a —A'l) 4 > 0. This condition 
can always be satisfied for some a since 4A" 2 (a — AD) 4 > 0. Thus, Si(~)S 2 is nonempty, 
which implies that Si p| S 2 is nonempty. Note that T>i = {(A'i 2 ,A" 2 i) : (a,/3) G Sip|S 2 }. 
Then, for each (AD 2 , ADi) G V i, —p is pseudo-concave on [0,oo). Moreover, it can be easily 
verihed that the composition p(x{uj)) is pseudo-convex on the set {co G M : uj > 0}, where 
x{uj) = uj 2 . In fact, since 


d 

duj 


■p{x{uj)) = 2 V p{x)uj, 


(58) 
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and for uj\,uj 2 G M> 0 , p{x{uj 2 )) — p{x{uji)) = p(x 2 ) — p{x i) with Xi = u>^ for i G {1,2}, by 
similar arguments as above, we have that 

-^p(x(u i))(cu 2 - u>i) > 0 (59) 

implies p{x{uj 2 )) — p(x{l>Ji)) > 0. Thus, for each (K 12 ,K 2 1 ) G Tb, a; i—)• p(x(a;)) is pseudo- 
convex on [0,oo); hence, for each (Ki 2 ,K 2 i) G T? l5 a; ||T(ja;)[| J is pseudo-concave on 
[0, oo). ■ 


c 


Proof of Lemma 




Consider the fc-th element of x t as with the algorithm in (155} . Let 5 k = (5}, <5f, ..., <5^), 
w k = (w k , w k ,... , lujy), = (u}, v k ,..., n^r) and x k = (x k , x k ,..., x%). We can rewrite (155} 
as 


= -/? 1( 5 fc - /3 2 £h fc - w fc , (60) 

w k = p 1 p 2 £S k -n N (i k + (hi k ), (61) 


where 


<? = ef - 4 X>‘>' 6 v - «,* = „*- n„(i* + ft**). 

J=1 

Then, we obtain 

0 

iM£ fc + /3ii fc ) J ’ 


,( -o 

1_ 

= Afc 

1_ 


w 


w 



where 


~PiIn ~ foil —In 

P 1 P 2 C 0 


(62) 


(63) 


(64) 


Applying (24} Lemma 4.3], we have lirn^oo (^{t) — jj YljLi = 0 f° r all 2 G V. Fur¬ 

thermore, this is true for all k G {1,2,..., n}. Therefore, the claim in Lemma is proved. 


D On two uncoupled Luenberger observers 

It should be noted that simply using two Luenberger observers without any coupling and 
taking the average of their estimates will not lead to both faster convergence rate and smaller 
steady state error. In fact, when Ki 2 = K 2 1 = 0, the system in ((6} proposes a structure 
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of two Luenberger observers without coupling. A direct calculation shows that for constant 
noise, the estimation error e satisfies 


e* 


1 ( K x I< 2 \ 

--- m 

2 \Ki — a K 2 -aJ 


Suppose the gain for the Luenberger observer in (J3]) is K L > 0 and a — K L < 0 for stability, 
where a — Kf denotes the rate of convergence of the Luenberger observer. To guarantee 
stability and that the rate of convergence of the proposed observer is no worse than that 
of a Luenberger observer, it is necessary to have K\ > A A and Ah > K L . It can be easily 
verified that in such a case 


1 / K\ K 2 \ 

2 \Kx — a ' K 2 — a) 


K l 

K l - a 


> 0 . 


(65) 


Thus. |e*| > e) as 4 (-J 1 -—h } > 0 and , hL > 0. 

’I I — l -Al 2 \Ki—a K 2 —CL J Kl—ci 

E Bound of gain as an inequality constraint 

To guarantee that the H^ from m to e is as small as possible, the bound of the transfer 
function T in the s-domain should be minimized, namely, we look for the minimum 7 > 0 
such that \T(juj)\ < 7 for all ugK, namely, we minimize the C 2 gain. [f| 

Lemma E.l Uil lU). Theorem 2-41] For the transfer function fl25l) defined hy (A, B, C, V), 
the following statements are equivalent. 

a) The system is stable and the H^ gain of the system is less than 7 for some 7 > 0, be., 

imu<7, 

b) There exists Ph = Pjf > 0 such that 


H e(A,P H ) 

PhB 

C T ' 



B t P h 

-7 1 

V T 

< 0 . 

( 66 ) 

C 

V 

- 7 1 




Remark E.2 The condition in item b ) of Lemma \E.l\ is the so-called Bounded Real Lemma 
condition; see, e.g., fA9[ \3Dj . 


8 Such a bound guarantees that |zoo(t)| 2 <A < y 2 / Q °° \m(t)\ 2 dt, and 7 is the C 2 gain, where to € C 2l 
the so-called Hoc gain [38] . 
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F Dilated LMI formulation for the interconnected ob¬ 


servers 

Proposition F.l Given a > 0 ; the rate of convergence of error system (l24j) is greater 
than or equal to a and the H <*, gain from m to e is less than or equal to 7 if there exist 
real matrices Ps, Ph, Qd, Qh and real numbers r D > 0, r H > 0 such that the following 
optimization problem (LMI) is feasible: 


inf 7 


He (.A, Q d ) + 2 aP s 

Ps ~ Qd + i^dA. t Qd 

< 0 , 



(67a) 

Ps — Qd + r dQJ)A 

—r d(Qd + Q~b) 



He (A., Qh) 

Ps ~ Qh + r H A T QH 

QhB 

C T ' 



Ps ~ Qh + r H QjjA 

—th^Qh + Qh) 

r H QhB 

0 

<0, 

(67b) 

B t Q h 

vhB t Qh 

-l 1 

0 

C 

0 

0 

- 7 1 . 



o' 

A 

bco 

^-1 

> 0 . 




(67c) 


Proof The proof follows from [401 Theorem 1 and Theorem 2], see also [311132j . ■ 
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